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Abstract 

This article reviews the prevailing paradigm for how galaxies and larger struc- 
tures formed in the universe: gravitational instability. Basic observational facts are 
summarized to motivate the standard cosmological framework underlying most de- 
tailed investigations of structure formation. The observed universe approaches spa- 
tial uniformity on scales larger than about 10 26 cm. On these scales gravitational 
dynamics is almost linear and therefore relatively easy to relate to observations of 
large-scale structure. On smaller scales cosmic structure is complicated not only 
by nonlinear gravitational clustering but also by nonlinear nongravitational gas 
dynamical processes. The complexity of these phenomena makes galaxy formation 
one of the grand challenge problems of the physical sciences. No fully satisfactory 
theory can presently account in detail for the observed cosmic structure. However, 
as this article summarizes, significant progress has been made during the last few 
years. 
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1 Introduction: Basic Cosmological Facts and Prin- 
ciples 

If the universe began in a state of near-perfect homogeneity and isotropy, then how did 
it become so inhomogeneous on small scales? This is the puzzle facing cosmologists 
sorting through the fossil relics of the early universe: the cosmic microwave background 
radiation, the chemical elements, the mass both visible and invisible, and the complex 
patterns — galaxies, clusters and superclusters of galaxies, voids, filaments, and fluctu- 
ations — that organize these ingredients. In the following we shall examine these fossils 
and the models that are being used to try to explain their origin. 



Before getting to details let us size up the problem by considering the length scales 
under investigation. I shall use cgs units; those who prefer may translate to parsecs or 
light-years (1 pc = 3.09 x 10 18 cm = 3.26 lt-yr), SI units, or anything else. Starting with 
the familiar, we note that the solar system, defined by the major axis of Pluto's orbit, is 
almost 10 15 cm in extent. It lies a distance of 2 x 10 21 cm from the center of our Galaxy. 
Our Galaxy is part of the Local Group of galaxies; the nearest galaxy as large as our own 
is M31, the Andromeda nebula, at a distance of 2 x 10 24 cm. The Local Group is about 
5 x 10 25 cm from the Virgo Cluster, which lies at the center of the Local Supercluster of 
galaxies. 

Cosmic structures are arranged, almost hierarchically — like a fractal on small scales, 
although not large scales — up to a size not much larger than the Local Supercluster. For 
comparison, the radius of the presently observable universe is about 10 28 cm, i.e., 10 10 lt- 
yrs (assuming that the age of the universe is about 10 10 yrs). Astrophysical cosmologists 
seek to understand structure on scales from roughly 10 22 to 10 28 cm [[J3], |57[] . 

1.1 Five Observations about the Universe 

Models of structure formation must take into account basic empirical properties of the 
universe averaged over large scales. Five sets of empirical facts seem especially relevant: 



1. The isotropy of distant objects. A standard statistical measure of anisotropy 
(deviations from spherical symmetry around us) is the angular 2-point correlation 
function w(8), giving the relative excess number of pairs of objects separated by 
angle 9 compared with the mean number for a Poisson distribution. For 9 in the 
range of 1 to 3°, \w\ & 10~ 3 for faint radio sources (primarily distant galaxies and 
quasars) For 9 = 10°, w < 10~ 4 for X-ray sources (fig. 10 of ref. f64|), which 
are also primarily distant galaxies and quasars. The cosmic microwave background 
radiation, after subtraction of a dipole (cosine) variation over the whole sky, shows 
fluctuations of rms amplitude 1.1 x 10 -5 [ |108|| averaged over circular patches of size 
10°. 

2. Hubble's linear velocity-distance relation. The celebrated discovery of cosmic 
expansion was made by Hubble in 1929 ||62|| . Galaxies shine by starlight (with some 



emission from rarefied gas) and thereby display well-known spectral lines owing 
to radiative transitions between quantum states of abundant chemical elements. 
The wavelengths of these lines are found to be shifted relative to their laboratory 
values, generally to larger values, in rough proportion to the distance r: cAA/A = 
cz ps H r, where c is the speed of light and H is the Hubble constant, H = 
/i/(10 10 yr), with h = 0.75 ± 0.25. The linear relation is modified for distances 
so large that H r/c approaches 1. The Doppler interpretation of cosmological 
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Nuclide 


Solar 


Primordial 


L E 


0.73 


0.76 


2 H 


2.3 x 10~ 4 


~ 3 x 10~ 5 


3 He 


5.0 x 10~ 5 


~ 2 x lO" 5 


4 He 


0.25 


0.24 


7 Li 


6.5 x 10~ 9 


~ io- 10 


C,N,0 


0.017 


< 10" 4 



Table 1: Abundances (mass fractions) of the light nuclei. 



redshifts is well established: v = cz[l + 0(z)} is the recession speed. Of much 
greater difficulty are distance measurements fl98| ; until recently, the accuracy of 
relative distance measurements was limited to about 20%; absolute distances are 
even more uncertain. (This fact explains the large error bar on the dimensionless 
Hubble parameter h.) 



3. The cosmic microwave background radiation. In 1965, Penzias & Wilson 
PBJ discovered that the sky glows brightly and uniformly in the microwave at 



wavelengths of about 1 cm. In the more than 25 years since then, the spectrum 
(wavelength-dependence) and isotropy of this radiation have been found to match 
a blackbody (Planck spectrum) with temperature To = 2.73 K. The Cosmic Back- 
ground Explorer satellite (COBE) has placed limits on the deviations from the 
Planck spectrum of less than 3 x 10~ 4 relative to the peak intensity ||83 . COBE 
also was used to make the discovery of anisotropy cited above. 



4. The abundances of light nuclei. The most abundant nuclei in the universe 
are 1 H and 4 He. Abundances of the light nuclei are shown in Table using data 
compiled by refs. |l|, |j~5| . All heavier elements - - primarily carbon, nitrogen, 
and oxygen — are grouped together as "metals" by astronomers. The relative 
abundances of the metals — but not the light nuclei — can be explained by nuclear 
fusion ("nucleosynthesis" in the jargon of astrophysicists) occurring in stars and 
supernovae ||19|| . In older stars the mass fraction of 4 He is less than in the sun, but 
in no case is convincingly below 0.22. Nucleosynthesis in massive stars produces a 
much larger ratio of metals to He than 1:10, and stars effectively destroy 2 H and 
Li. (Deuterium is probably enhanced in the solar system by chemical fractionation, 
while some Li can be produced by cosmic ray spallation.) Thus, the light nuclei 
could not have been produced in stars. The only satisfactory explanation known 
was proposed by Gamow, Alpher and Herman in the late 1940s [f|7L [|: the light 
elements were produced by nucleosynthesis at relatively low temperatures ( ^ 10 9 
K) and high densities for a duration of several tens of seconds. 
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5. The existence of large amounts of dark matter. In the 1930s it was recog- 



nized by Zwicky [123] that galaxies in clusters move too rapidly for the clusters 



to remain bound, assuming that the galaxies weigh no more than the visible stars 



and gas they contain. In the 1980s, Rubin and others |99| discovered that stars 
and gas clouds in the outskirts of spiral galaxies also orbit too quickly around the 
center to be held in place by the gravity of the visible matter. The simplest expla- 
nation is that there is unseen mass present in galaxies and clusters. A wide range 
of evidence supports the conclusion that most (perhaps 90-99%) of the mass in 
the universe is much less luminous (per unit mass) than stars. Ordinary luminous 
matter dominates in the central regions of galaxies, with the dark matter forming 
an extended "halo" around the luminous parts. Unfortunately, little else is known 
about the dark matter. Three outstanding questions are: (1) What is the dark 
matter? (2) How much is there? (3) How is it distributed through space? Spec- 
ulative answers have been given to all of these questions, e.g., the dark matter is 
some new type of elementary particle, abundant enough to just close the universe, 
and it is distributed somewhat more uniformly than galaxies. However, this is 
speculation. The dark matter problem posed by these three questions is currently 
one of the most outstanding puzzles in all of science. 

1.2 Simple Cosmological Models 

The five sets of facts summarized above underly the cosmological models considered 
tenable by astrophysicists. In particular, the first two items, large-scale isotropy and 
the Hubble expansion, motivate the Cosmological Principle introduced by Einstein and 
Milne [jS4j| : The universe is approximately homogeneous and isotropic on large scales 
with a uniformly expanding mass distribution. 

Spatial homogeneity is difficult to establish because we cannot travel to a distant 
galaxy to see whether from there the universe looks similar to our vicinity. However, it 
is a natural extension of the Copernican principle, which asserts that our vantage point 
is not special. If the universe is isotropic (in the large) around every point, then it is 
necessarily homogeneous. The available data are consistent with large-scale homogeneity. 

Uniform expansion means that the galaxies separate with time with all distances 
scaling in proportion to a universal expansion scale factor a(t) where t is the proper time 
measured by observers in each galaxy. The galaxies themselves do not expand, nor do 
any other bound systems such as galaxy clusters (or, on a much smaller scale, the solar 
system). Actually, there are slight departures from perfectly uniform expansion even on 
large scales. Figure [I] illustrates the concept of perturbed Hubble expansion. 

There are several widespread misconceptions about the Hubble expansion. The first 
is associated with the question, "What is the universe expanding into?" It is not ex- 
panding into anything. The universe is all of space, so none is left to accommodate 
the expansion, nor is any more space necessary. Another misconception is that Hub- 
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Figure 1: Perturbed Hubble expansion. 
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ble expansion is a purely general relativistic effect due to the stretching of space. It is 
more accurate to say that galaxies are moving apart because they were set in motion by 
some initial mechanism; it then follows automatically that the distances between them 
increase with time. This Newtonian interpretation is valid for H r/c <C 1; general rela- 
tivistic models simply extend the expansion to the universe as a whole. Finally, there is 
misunderstanding about how the universe can have a finite volume — a possibility that 
cannot be excluded (but neither is favored) by observations. If the volume is finite, we 
expect the space to be compact (like an ordinary 2-sphere but with one more dimension) 
and not embedded in anything else. Practically all cosmological models — including the 
ones discussed in this paper — are founded on general relativity (or some modified metric 
theory), which gives a precise description of space, time, and cosmological expansion. 

Simple models of a homogeneous and isotropic, uniformly expanding universe were 
introduced during the period 1917-1940 by de Sitter, Friedman, Lemaitre, Robertson, 
Milne, and others.^ Although Einstein proposed the first cosmological model as a solution 
to his field equations of general relativity in 1917, he assumed, incorrectly in hindsight, 
that the universe was static. To force his field equations to yield a static (as opposed 
to expanding or contracting) solution he added an extra term called the cosmological 
constant. 

In all of these models except Einstein's, all separations between objects scale with 
time in proportion to the universal scale factor a(t). Thus, the position of each galaxy 
relative to some origin (in fact, any location, such as the Earth, may be taken as origin) 
may be written r = a(t)x, where x is a constant vector for that galaxy, called the 
comoving position. The Hubble law follows at once: v = df/dt = Hr where H(t) = 
dlna/dt. This result implies that H need not be independent of time; only its present 
value, H , is called the Hubble constant. 

We know that a(t) is presently increasing with time and will double in about 10 10 
years. Therefore it was smaller in the past, and may have been very small (possibly 
even zero) at some finite proper time in the past. In fact, points (iii) and (iv) of section 



1.1 support the notion that about 1.5 x 10 years ago, the expansion scale factor was 



very much smaller than it is now, and that the universe began expanding tremendously 
rapidly in an event that has come to be called the "big bang." The reasoning is simple: 
the temperature of an expanding gas (such as fills the universe) decreases adiabatically 
if there is no heat input. The heat content of the microwave background radiation is far 
too large to have been produced at low energies except in highly contrived models [94, ||. 
Therefore the mass and radiation in the universe must have been hotter and denser in 
the past. 

When the temperature was above 10 10 K, atomic nuclei were dissociated into protons 
and neutrons. The universe expanded and cooled rapidly, requiring only a few minutes to 
cool through the era of cosmic nucleosynthesis. About 3 x 10 5 years later the temperature 
dropped below 3500 K, the temperature at which hydrogen ionizes at cosmic density. The 



1 Many of the key papers appear, in English, in ref. |J. 
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hot gas filling space glowed bright red at this time with a Planck spectrum because the 
gas and radiation were in thermal equilibrium — the radiation was scattered, absorbed, 
and reemitted rapidly by the plasma. After this time the protons and electrons joined 
to form neutral hydrogen gas, which is almost completely transparent to radiation. At 
the present time we look out in distance, and therefore back in time, to see the radiation 
left over from this "recombination" era. Because of the large distance and relativistic 
recession velocity, the radiation is redshifted by a factor of about 1100, so that we detect 
it as the microwave background radiation. The big bang theory accurately predicts 
both the nuclear abundances and nearly perfect Planck spectrum and isotropy of the 
cosmic microwave background radiation in an approximately homogeneous and isotropic 
expanding universe. 

To my knowledge, no alternative to the big bang has been able to account for points 
(i)-(iv) in section \L.1[ Therefore I shall assume the basic scenario outlined in the previous 
paragraph. This does not require me to ask what preceded or caused the big bang. Those 
remain metaphysical questions in the absence of any empirical data. 

The basic laws of a uniformly expanding universe obeying general relativity theory 
were first set out by Friedman in 1922 |46|]. Just as one would expect from Newtonian 



ideas, the cosmic expansion decelerates due to gravity. Consider a cosmological model 
with uniform mass density p(t) decreasing with time owing to the expansion. (The 
equivalence of mass and energy implies that p includes all forms of mass and energy. 
For nonrelativistic matter p oc a~ 3 from mass conservation, while for a relativistic gas 
p oc a -4 because the energy decreases due to the work done by pressure during the 
expansion.) The expansion rate H = d In a/dt obeys the Friedman equation 

rr2/ . 8ttG K 
H 2 {t) = —p 



a 2 (t) ' 

where G is Newton's constant and K is the cosmic curvature constant, related to the 
curvature of three-dimensional hypersurfaces of constant time in spatially homogeneous 
and isotropic (or Robertson- Walker) models. Euclidean space has K = 0, while a closed 
(compact) universe with finite volume (a three-sphere) has K > and an open universe 
(a three- hyperboloid) has K < 0. 

The Friedman equation relates the geometry of space to the mean mass density. By 
combining H and Gp we can define a dimensionless density parameter Q: 

where to refers to the present. The Friedman equation now reads K = (Q — l)(aH) 2 , 
showing that the spatial curvature depends on the mean density of matter. The mean 
density also determines the evolution of a(t). For a gas with pressure p = ape 2 with 
a > — | (e.g., a = +| for a relativistic gas of photons while a ~ for nonrelativistic 
matter), the Friedman equation may be integrated to show that the universe will continue 
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Figure 2: Cosmic expansion factor in a zero-pressure Friedman model. 



expanding forever if K < while it will eventually cease expanding and will recollapse 
if K > 0. Moreover, all of these models begin with a singularity a = at some finite 
time in the past. Figure |2| illustrates the solutions for a matter-dominated universe with 
p pc 2 . 

In past decades, astrophysicists took the Friedman models rather literally, and cos- 
mology was widely regarded as a search for two numbers, H and Q = £l(t ). However, 
that time is past. We recognize that the universe need not be homogeneous and isotropic 
on scales much larger than the 10 10 lt-yrs we can see (in principle) today and that there 
is no way to know if the Friedman models are globally correct. These questions, like the 
one of what preceded the big bang, are metaphysical. All that we know about the global 
spacetime geometry is that within our observable patch the universe looks, to a good 
approximation, like a Friedman model with 0.2 £ Qq £ 2. Some observational cosmol- 
ogists may go further and narrow the range of Qq, or advocate a nonzero cosmo logical 
constant A, which adds a term A/3 to the right-hand side of the Friedman equation. 



However, there is no consensus on these issues ||102| , |22 . 

Interest in the value of Qq remains strong for several reasons. First, during the last 15 
years astrophysicists have realized increasingly that large amounts of dark matter exist 
in halos surrounding galaxies, increasing estimates of Qq beyond those from the visible 
matter. Second, certain theories of the early universe make predictions for the value 
of Qq, in particular, the inflation theory of Guth f56|. According to this theory, which 
supplements the standard big bang model, at very early times the cosmic expansion was 
accelerated tremendously by a temporary large cosmological constant, causing a(t) to 
increase exponentially so that the curvature term in eq. ([!]) became strongly suppressed 
relative to the other terms. As a result, spatial curvature should be negligible today so 
that Qq — 1. (Any remaining cosmological constant term can be included in p as a con- 
stant "vacuum" energy density.) The inflation theory is attractive because it can explain 
the large-scale homogeneity and isotropy of the universe, but it remains speculative. 

Direct observation of stars, with a reasonable extrapolation for faint and burned out 
stars, yields only Q £ 0.01, far less than predicted by inflation. For cosmic nucleosynthe- 
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Density [g cm 3 ] 


Object 


Size 


1 

io- 10 

1Q -24 
10 -29_ 10 -30 


Earth 
Solar system 
Galaxy 
Universe 


F7 

10 cm 
10 15 cm 
10 22 cm 
10 28 cm 



Table 2: Characteristic mass density for objects of increasing size. 



sis to produce the observed abundances of the light nuclei, the abundance of "baryonic" 
matter (made of protons, neutrons, and electrons) is relatively tightly constrained [|116|| : 
VL-qK 1 = 0.0125 ± 0.0025. From the gravitating mass in galaxies and clusters of galaxies 



one infers f2 ~ 0.2 [92, [L13||, implying the existence of much more dark than luminous 



matter. However, these are all lower limits to the global mean value of f2 , because dark 
matter may exist between galaxies and clusters. A recent large-scale measurement of the 
gravity field implies f2 > 0.3 |36| . 

Dark matter is a complication of simple big bang cosmology. Its nature is important 
for galaxy formation but only its abundance is important for homogeneous cosmology. 
Many searches for dark matter are presently underway [pBfl , but it is possible that most 
of the dark matter interacts so weakly with ordinary matter and radiation as to be 
undetectable aside from gravitational effects. An alternative way to investigate dark 
matter is to test specific theories of dark matter for their predictions for the formation of 
galaxies and large-scale structure, which are sensitive to the gravitational clustering of 
dark matter. Before doing that, we shall discuss another important and possibly related 
fact about the universe: It is not perfectly homogeneous and isotropic. 



1.3 The Perturbed Universe 

Like Darwin's theory of the evolution of species and Alfred Wegener's theory of conti- 
nental drift, the big bang theory is only a starting point for more detailed models of 
the universe. It is a framework that requires additional ideas for a complete and consis- 
tent physical cosmological theory. Chief among the missing ideas are those relating to 
departures from strict homogeneity and isotropy. 

The universe is extremely inhomogeneous: the density of this paper (or the computer 
screen displaying these characters) is about 30 orders of magnitude denser than inter- 
galactic space. This fact is not necessarily incompatible with the Cosmological Principle 
stated in section |1.2| , because the inhomogeneity is much less when measured on larger 
scales (see Table ||). Nevertheless, it begs the question: How did these inhomogeneities 
develop? 

Among the four known fundamental forces, gravity would seem, a priori, to be the 
most likely agent responsible for the formation of cosmic structure. After all, we know 
that gravity holds together the Earth and the solar system, and that the purely attractive 
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and long-range nature of gravity cause it to be more important on large scales than the 
other forces (electromagnetic, strong, and weak nuclear forces). 

To test whether gravity might create structure it is useful to consider the evolution of 
small-amplitude perturbations of a homogeneous medium. As in plasma physics, optics, 
and other disciplines, we consider the propagation of waves in a uniform background, 
with relative density fluctuation 5p/ p oc exp[i(k -x — ut)]. For simplicity we shall neglect 
the expansion of the universe, which is a good approximation over a short period of time 
if uo 2 ^> H 2 . Linearizing the equations of motion for the matter (here, approximated by 
the perfect fluid equations), one obtains the dispersion relation 
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~\ 1/2 



k 2 c 2 s -AnGp = c 2 s (k 2 -kj) , kj={— . (3) 



This result is similar to the dispersion relation for high-frequency electromagnetic waves 
in a plasma, where the sound speed c s is replaced by the speed of light c and the term 
—4irGp is replaced by the square of the plasma frequency, u 2 = +4nn e e 2 / m e for elec- 
trons of charge e, mass m e , and number density n e . Gravity differs from electricity in 
two essential ways. First, the gravitational "charge-to- mass" ratio (gravitational mass 
divided by inertial mass) is 1 for all objects as was discovered by Galileo almost 400 
years ago. Second, gravity has the opposite sign: all masses attract each other. The sign 
difference is crucial: it leads to gravitational instability of long-wavelength fluctuations, 
as was first pointed out by Jeans nearly a century ago |66|]. When k < kj (kj is known 



as the Jeans wavenumber), uo 2 < so that one of the two roots of the dispersion relation 
corresponds to exponential growth. 

In reality, the growth is exponential only for a static medium with H = and 
p = constant. In an expanding universe p decreases with time and therefore so does the 
growth rate. In this case the linear growth of perturbations is proportional to a power 
of t rather than being exponential. 

But how do we know this instability is physical? After all, the other imaginary root 
of the dispersion relation for k < kj leads to damping; perhaps the growing solution 
should be discarded. 

Gravitational instability occurs because because it is energetically favorable for per- 
turbations to grow in amplitude. For example, dense regions gain negative gravitational 
energy by collapsing, more than compensating for the increase in positive kinetic energy. 
Another way to view this process is that overdense regions in an Q = 1 universe are 
like small portions of closed (Q > 1) universes; therefore they expand less rapidly than 
their surroundings and eventually collapse (fig. §). Conversely, underdense regions are 
like small portions of open (Q < 1) universes which expand more rapidly. The result is 
that matter is transferred from underdense to overdense regions. Figure |3] illustrates this 
process in a small cosmological N-body simulation beginning from a slightly perturbed 
zero-pressure Friedman model. The evolution of one two-dimensional layer of particles 
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is shown; the total simulation used 16 3 particles and integrated Newton's laws in three 
dimensions with periodic boundary conditions. 

Granted that gravity leads to a linear instability — but perhaps this instability satu- 
rates at moderate amplitude like so many others do in the nonlinear regime. This is not 
so. In fact, gravitational instability strengthens with collapse because the gravitational 
energy of a given mass, roughly —GM 2 /R, diverges as R — > 0. The instability ceases 
only after a gravitationally bound object forms with enough internal kinetic energy to 
support itself against further gravitational collapse, as in the case of the Earth orbiting 
the sun or the gas in the sun itself. Bertschinger & Jain |12| have recently proven that 
gravitational instability in a cold, initially homogeneous medium inevitably drives mass 
elements with density exceeding the critical density to collapse to arbitrarily high density, 
until pressure, vorticity, or other non-gravitational forces inhibit further collapse. 

Dissipative processes drive self-gravitating systems to still higher density. Self-gravitating 
systems are peculiar in that they have negative specific heat and therefore no stable ther- 
mal equilibrium state in general. This behavior follows from the classical virial theorem 
|5^1 , which relates the kinetic energy K and the gravitational potential energy W of an 
equilibrium self-gravitating system: 2K + W = K + E = where E is the total energy. 
The specific heat is thus dE/dK = — 1. If energy is removed from the system (by atomic 
radiative processes, for example), E decreases but K increases; the system shrinks and 
gains twice as much gravitational binding energy as it loses to radiation, requiring the 
kinetic energy to increase to maintain equilibrium. 



2 Large-Scale Structure: Is Gravity Responsible? 

In the preceding section we have outlined the main ingredients needed to investigate 
the formation of cosmic structure. Now we consider the evidence for and implications 
of cosmic large-scale structure, defined by the distribution of galaxies and dark matter 
on scales larger than about 10 26 cm. Averaged over these scales, the number density of 
galaxies (and, we believe, the net mass density) is relatively smoothly varying, with fluc- 
tuations \Sp/p\ £ 1. On these scales gravitational instability is therefore still relatively 
mild so that we can hope to infer the primeval fluctuations and test specific theories for 
their origin and evolution. The main question we address is this: Is gravity responsible 
for the formation of large-scale structure? 

In gravitational instability models the structure of spacetime is perturbed by small- 
amplitude fluctuations in the gravitational potential <f)(x,t). The description of this 
system in general relativity is not difficult. The spacetime geometry is given by the line 
element of a perturbed Robert son- Walker model: 

ds 2 = -(1 + 2<p/c 2 )c 2 dt 2 + (1 - 2(p/c 2 )a 2 (t)(dx 2 1 + dx\ + dx\) . (4) 

[We are assuming (0/c 2 ) 2 <C 1, the background spatial curvature K is negligible on 
scales of interest, and gravitational radiation and other purely relativistic effects are 
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Figure 3: Gravitational instability in an expanding universe. 
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unimportant. These are very good approximations for the problem at hand.] In general 
relativity, as in special relativity, time and space are not independent; invariant distances 
can be defined only by combining all four coordinates. However, we do not use the same 
coordinate system as in special relativity. Besides including the gravitational potential in 
the line element, we factor out the cosmic expansion scale factor a(t) from our coordinates 
Xi — they are the comoving coordinates introduced in section |1.2| . 

The presence of the gravitational potential (f) in eq. (f|) represents the variations in 
the spacetime geometry caused by density fluctuations. (In general relativity theory, 
gravitational "forces" are caused by variations in the spacetime curvature. Freely falling 
particles follow geodesies - the "straightest" possible curves in a curved spacetime.) The 
perturbed Einstein field equations imply a cosmological version of the classical Poisson 
equation of Newtonian gravity (assuming nonrelativistic sources and length scales small 
compared with the Hubble distance c/H): 



a- 2 V 2 4> = 4nGp[ -^\ . (5) 




The factor a~ 2 is necessary because the spatial Laplacian is with respect to the comoving 
coordinates. The chief difference from Newton's version is that the source is not the 
total mass density p; rather it is the density fluctuation 5p = p — p. This difference is 
insignificant within the solar system but is very important in cosmology. If the source 
were p then <p would diverge in an infinite universe and the gravity field would not be 
well-defined. Newton recognized this problem but its solution had to wait for Einstein. 
If the density field approaches homogeneity on large scales, 5p has vanishing spatial 
average and <ft is convergent. A rigorous proof requires full general relativity theory, but 
the result stated here is correct for the types of perturbations of Friedman models under 
consideration. 

Note that Sp/p may be large yet </>/c 2 small. For a mass fluctuation of proper wave- 
length A, the solution of eq. (|5|) has characteristic amplitude 



A \ 2 dp ( A \ 2 dp 



-kc/Hq) p Yl0 28 cmy p 



(6) 



Thus, mildly nonlinear structures of size 10 26 cm do not produce large potential fluctua- 
tions. This is fortunate, because otherwise the Einstein equations would be much more 
complicated than eq. (Q) and large-scale dynamics would be harder to relate to initial 
conditions. 

In many models for the formation of large-scale structure, gravitational potential 
fluctuations were generated by some physical process occurring in the early universe. 
For example, in the inflation scenario, quantum fluctuations in the field driving the 
inflationary expansion lead to large-scale density fluctuations. Without some source of 
potential fluctuations it is difficult to understand how large-scale structure could have 
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formed. Thus, we shall examine the consequences of assuming a nonzero <f)(x, U) as our 
initial conditions for structure formation. At this stage we are not concerned with what 
produced these fluctuations; the first question is whether they can produce the large-scale 
structure. 

Primeval potential fluctuations have three major observable effects: (1) They produce 
microwave background anisotropy; (2) They imply inhomogeneities in the distribution 
of mass; and (3) They induce nonzero velocities relative to the uniform Hubble flow. 
Observations of microwave background anisotropy, galaxy clustering, and galaxy mo- 
tions therefore can be combined to test the consistency of gravitational amplification of 
primeval fluctuations. 



2.1 Cosmic Microwave Background Anisotropy 

A very important step in testing of the gravitational instability paradigm follows from the 
exciting discovery last year of anisotropy in the cosmic microwave background radiation 
by Smoot et al. ||108| using the COBE satellite. How is this anisotropy related to the 



primeval potential fluctuations? 

Photons traveling to us from the recombination layer (the cosmic photosphere occur- 
ring when the temperature dropped below 3500 K) did work against gravity in climbing 
out of the gravitational potential minima. Consequently, the microwave background 
temperature should be slightly smaller in the direction of potential minima on this pho- 
tosphere compared with its average value and higher toward potential maxima. The 
magnitude of this gravitational redshift effect in terms of the radiation brightness tem- 
perature is AT/T = A0/c 2 . However, it is partially offset by the fact that the density 
is higher in the potential minima from the Poisson equation, so the temperature is also 
higher and recombination occurred there a little later than elsewhere. As a result these 
photons have suffered less cosmic Doppler shift in traveling toward us. The net anisotropy 



IS 



A2\_ 1 



(n) = — A0(f,t r ) , (7) 
T 3c 2 

where x lies at the cosmic photosphere (nearly at the edge of the presently observable 
universe) in direction n, and t T is the time of recombination. This simple formula is 
valid for isentropic (constant entropy) fluctuations on sufficiently large scales (larger 
than about one angular degree) so that acoustic waves in the coupled photon-baryon 
fluid have not modified the temperature. The simplest models of fluctuation generation 
predict isentropic fluctuations, with constant ratios of the fluctuations for all components 
- photons, baryons, neutrinos, etc. The microwave background anisotropy generally is 
larger if these ratios vary. Eq. (|7j) also assumes that the gravitational potential does not 
evolve in time after recombination. 

Figure ^ shows the angular correlation function of the cosmic microwave background 
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Figure 4: The angular correlation function of cosmic microwave background temperature 
anisotropy. Points with error bars are taken from [119]. Dotted curves show three Monte 
Carlo samples for a scale-invariant primeval density fluctuation spectrum including real- 
istic sampling and noise. The mean correlation function for this model, convolved with 
the COBE beam and sampled like the observations, is drawn as the smooth heavy curve. 



temperature, denoted here by C{6) instead of the w(9) referred to earlier, defined by 

= (AT^) AT(n 2 )) Hi . H2=cosd , (8) 

where the angle brackets denote an average over all pairs of directions separated by angle 
6. Also shown in fig. [| are several Monte Carlo simulations characteristic of a scale- 
invariant spectrum of primeval potential fluctuations || |105|| , as predicted by the simplest 



inflationary cosmology theories. Although the statistical fluctuations due to receiver 
noise as well as to finite sample size are appreciable, the detected signal is statistically 
highly significant and indicates the presence of very long- wavelength fluctuations in the 
universe. 

The COBE anisotropy measurements are important for showing that large-scale po- 
tential perturbations indeed exist, as required by gravitational instability models for 
cosmic structure formation. However, they have an important limitation: the COBE 
measurements probe only very large scales. At cosmological distances, an angle of 10° 
(approximately the minimum scale probed by COBE) corresponds to a linear size of 1.05 
(Qoh)^ 1 comoving Gpc (or 4 x 10 27 cm), more than seven times larger than the biggest 
known galaxy superclusters (e.g., the "great wall" of galaxies, cf. ref. ||50||). COBE alone 
cannot test structure formation theories. Other measures of structure are needed. More- 
over, these measures must allow us to relate the amplitude of fluctuations on different 
size scales. 
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2.2 Gravitational Potential Fluctuations 



In all models, the primeval potential field is a stochastic quantity whose statistical prop- 
erties, but not its actual value, may be predicted a priori Assuming that the potential 
has a well-defined mean value (which we may take to be zero without loss of general- 
ity), the most important simple statistic is its two-point correlation function. In order 
to express the scale-dependence of the potential it is most convenient to work in the 
Fourier transform space such that fluctuations are expanded in plane waves exp(ik ■ x). 
(If K 7^ the background space is non-Euclidean and plane waves must be replaced by 
the appropriate eigenfunctions of the spatial Laplacian.) Note that here we take x to 
be the comoving position defined in section |1.2| ; therefore k is the comoving wavevector. 
The two-point correlation function in Fourier space is given by the power spectral density 
(power spectrum) P^k.t), defined so that the variance of <p is 



Here the angle brackets denote either an average over an ensemble of universes or a 
spatial average for a single universe; they are equivalent for ergodic processes including 
all widely studied cosmic fluctuation models. Note that the variance is independent of 
position because we assume the potential to be a homogeneous random process, and 
the power spectral density is independent of the direction of the wavevector because we 
assume the potential to be statistically isotropic. In making these assumptions we are 
restricting the class of models for the perturbations. 

The power spectral density is a useful quantity for comparing the amplitudes of fluc- 
tuations on different size scales. It is common in cosmology to represent the fluctuations 
in terms of Sp/p, but we have seen above that the gravitational potential is more natural 
because of its simple relation to spacetime curvature fluctuations (eq. f|) and to cosmic 
microwave background fluctuations (eq. |^). Another reason for preferring <p is that its 
time derivative (at fixed comoving position) vanishes in a matter- or radiation-dominated 
universe for wavelengths exceeding the distance sound waves can travel in the age of the 
universe. (Acoustic waves produce damped oscillations of the potential.) Given the 
potential, it is easy to get the density fluctuation from the Poisson eq. (|5|). 

In place of the power spectral density we introduce a quantity called the potential 
amplitude function ||: 

A(k,t) = c- 2 [k 3 P (P (k,t)f 2 . (10) 

This quantity measures the root-mean-square potential fluctuation (divided by c 2 ) on 
scale k , it is dimensionless (usually we set c = 1 anyway). The potential amplitude 
function has two advantages over the spectral density. First, it more naturally indicates 
the amplitude of fluctuations as a function of scale. Second, it is constant for a scale- 
invariant spectrum. A scale-invariant spectrum is defined to be one such that A(k,t) 
is independent of k and is a good approximation to the simplest inflationary models of 




(9) 
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the early universe, which predict only a weak (logarithmic) dependence on k. Often 



such a spectrum, also known as the Harrison-Zel'dovich-Peebles spectrum |122| , [95 
is described by saying that the power spectral density of Sp/p is proportional to k. 
Complicated arguments about the growth of perturbations are cited to explain why 
P$ oc k is "scale-invariant." It is much simpler to say that the potential fluctuations 
have the same amplitude on all scales so that P^ oc A; -3 , and then to note that the 
Poisson equation implies P$ oc k^P^ oc k. 

2.3 Galaxy Redshifts and Peculiar Velocities 

As noted above, the anisotropy of the cosmic microwave background radiation has been 
measured on scales larger than galaxy superclusters. To probe smaller scales cosmologists 
use the galaxies themselves as tracers of the density and velocity fields. 

Under the assumption that galaxies are distributed like mass on large scales ("light 
traces mass" ) , the galaxy number density distribution n g (x ) is related to the mass density 
distribution. This relationship is complicated in several ways. For example, we cannot 
measure accurately the three-dimensional distribution of galaxies because distances are 
difficult to measure and are highly uncertain. Because of the difficulty of obtaining 
accurate extragalactic distances, astronomers generally settle for redshifts and assume 
them to be related approximately to distances by the Hubble law. 

Catalogs of redshifts and angular positions are called redshift surveys. With the 
advent about a decade ago of high quantum-efficiency detectors, galaxy redshift surveys 
have grown by a large factor and now encompass about 10 5 galaxies. Figure |5| shows part 



of the well-known second Center for Astrophysics redshift survey |3£| |50], |63[ showing 
evidence for a bubble-like topology of galaxy clustering. 



Large redshift surveys have been conducted and analyzed by many groups |1| |103 



110| , 115 , 78 , 44, 104 1; see ref. [51] for a review of earlier work. Smoothed over the 
intergalactic spacing, the surveys provide a measure of the density fluctuation field 5n g /n g 
whose spectral density may be compared with theoretical models. A simple formula 
fitting most of the results very well was proposed by Peacock [58|; in terms of the 
potential amplitude function this formula is 

The factor Q is included because the potential is proportional to the mean mass density. 
Ref. U discusses the estimates of the parameters in eq. flnp. A good fit over the range 
0.02 < k Mpc/h < 1.0 is provided by A = 1.27 x 10" 5 , 7 = 2.4, and k c = 0.024 h Mpc" 1 . 

One should be cautious in extrapolating the results of galaxy surveys to the mass 
distribution as a whole because there is no proof that dark matter is distributed like 
galaxies. Indeed, it is known that dark matter is less concentrated toward the centers of 
galaxies than luminous matter. It is also plausible that averaged over larger scales galaxy 
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Figure 5: A slice of the universe: galaxies from the second Center for Astrophysics 
redshift survey [63]. All galaxies whose flux exceeds a certain limit in a wedge of the 
sky of area 0.21 steradians are plotted with one angular coordinate (right ascension) and 
radial velocity. 



number density fluctuations are an amplified version of the mass density fluctuations due 
to a process called biased galaxy formation: 6n g /n g = b5p/p. This relation, suggested by 
Kaiser on the basis of the statistical properties of peaks of a gaussian random field |B7J] , is 
highly schematic; because galaxies are effectively a point process while the mass density is 
effectively continuous, the relationship only makes sense for the smoothed fields. Possible 
but speculative physical mechanisms leading to this bias have been suggested |37] [17]] . 
The "biasing factor" b may also be a function of smoothing scale and position, as well 
as galaxy type. However, in the absence of a detailed theory of galaxy formation having 
real predictive power, it is reasonable to use such phenomenological models to interpret 
galaxy redshift surveys. 

The second probe of potential fluctuations suffers little from uncertainty about how 
galaxies trace mass. This method is based on the so-called "peculiar" velocities of galaxies 
- their residuals from the Hubble flow |20[| . We noted before that the proper position 
is r = a(t)x and the Hubble velocity is Hr = (da/dt)x. The peculiar velocity is then 
v = df/dt — Hr = adx/dt. The key idea is that galaxies acquire their peculiar velocities 
as test-bodies falling in the large-scale perturbed gravitational field. If peculiar velocities 
initially are small, so that objects fall from rest — relative to the background cosmic 
expansion! - the peculiar velocities initially grow in proportion to the gravitational 
field: v oc g = — a -1 V0. The constant of proportionality is not exactly equal to the 
cosmic time, because the gravity field may change with time (as well as varying in space). 
Assuming galaxies have not moved far (in comoving coordinates) so that a linear relation 
still applies, the constant of proportionality still depends on Q. Taking the divergence of 
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this velocity-gravity relation and using the Poisson equation, we can estimate the mass 
density fluctuations giving rise to the peculiar velocities [^]: 



(12) 

The factor f2 6 approximately expresses the ^-dependence of the peculiar velocities for a 
wide range of cosmologies 0, [76]]. Eq. fll2[) is valid only if \Sp/p\ ^ 1, but modifications 
work well in the quasi-linear regime (5p/p ^ 5) P5| |53| . The most important points 
are that galaxies provide an unbiased tracer of the gravity field because all bodies fall 
the same way in a gravitational field (Galileo), and the inferred density fluctuation field 
applies to all the mass and not just the galaxies. Thus, peculiar velocities offer an 
excellent means for testing theories of large-scale structure. 

These ideas have been used by my collaborators and I to reconstruct the large-scale 
mass density fluctuation field || |35], [K| by applying a method called POTENT to a 



large sample of estimated galaxy distances |7S|, f£|. Recently we have compared the 
mass density fluctuations from POTENT with the galaxy density fluctuations from a 
complete redshift survey of galaxies selected from the infrared survey made by the IRAS 
satellite IP-lOfl . Figure |6| shows the results in one plane through the local universe |36 |. 



The results of this comparison show consistency within the measurement uncertainties, 
which are dominated by galaxy distance errors, provided f2 ~ 0.3. Unless galaxies are 
much less clustered than the dark matter (the opposite of what is usually assumed), our 
results strongly exclude low-density models, even spatially flat ones in which most of 
the energy density is in the form of a cosmological constant. Similar conclusions were 
reached in an earlier comparison of the radial velocity and gravity fields |69| . 

The velocity- density comparison relies on galaxy distance measurements (for the pe- 
culiar velocity) that may be prone to systematic errors in addition to the large statistical 
uncertainties. Large-scale measurements of f2 also can be made using galaxy redshift 
surveys alone, assuming that the spatial clustering is statistically isotropic and that the 
observed radial distortions are due to peculiar velocities f68|, [58]. Present results from 



this method have large observational uncertainties |59fl ) but the size of galaxy redshift 
surveys will increase ten-fold by the end of the decade, enabling a powerful comparison 
to be made of different large-scale dynamical measurements of Qq. 



2.4 Can Gravity Account for Cosmic Structure? 

In the preceding sections we described three ways to estimate the gravitational potential 
field: cosmic microwave background fluctuations, galaxy redshift surveys, and gravity 
field measurements. By combining estimates of the potential amplitude function made 
using these different techniques we can test whether the available data are consistent 
with one mechanism for inducing large-scale structure. 
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Figure 6: Contours of smoothed density for galaxies (left) and mass (right) in the plane of 
the Local Supercluster of galaxies [36]. Contours are spaced by 0.1 in 5p/p, with positive 
values solid, negative values dashed, and the zero contour slightly thicker. Outside 
the heavy solid line the standard error of the mass density fluctuation exceeds 0.2, while 
outside the heavy dashed line the sampling of galaxies is too sparse for reliable estimation 
of the mass density. 
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Figure 7: The potential amplitude function from measurements of cosmic microwave 
background anisotropy (COBE band), galaxy redshift surveys (dashed-dotted curve 
marked IRAS, CfA galaxies; its extrapolation to longer wavelengths is indicated as a 
dotted line), and peculiar velocities (crosses marked POTENT) [7]. Scaling density and 
velocity to potential requires assuming Q ; two different choices are shown in parts (a) 
and (b). Solid curves going through the COBE band is the linear theory prediction for 
flat cold dark matter (CDM) models normalized to COBE. The dashed curve shows the 
CDM models smoothed the same way as POTENT. 



Figure |7j shows the present-day potential amplitude function inferred from the mea- 
surements described above ||. The COBE points depend on Q because if Q ^ 
1, the gravitational potential fluctuations change with time, increasing the microwave 
anisotropy for a given potential amplitude |53| and therefore requiring smaller-amplitude 
potential fluctuations today for a given measured anisotropy. The estimates based on 
galaxy number density and peculiar velocities also depend on Q . The spectrum esti- 
mates from peculiar velocities are not corrected for noise power. Seljak & Bertschinger 
|106|| recently have carefully analyzed the peculiar velocity data and showed that they are 
compatible with the cold dark matter model (discussed in the next section) for Qq — 1 
but not for Q = 0.2, just as one would infer from fig. |7|. 

Remarkably, the various measurements of the gravitational potential amplitude are 
in rough agreement with each other and with the cold dark matter theory over more 
than three decades of spatial frequency. Given the difficulty of making these measure- 
ments, agreement to within a factor of two is tremendously reassuring. If gravity were 
not responsible for cosmic structure, there is no a priori reason one should expect any 
agreement at all. 
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3 Small-scale Structure: How did Galaxies Form? 



Even if gravity alone is responsible for cosmic structure formation, one may fear that 
nonlinear evolution is so complicated that it is impossible to say anything about the 
initial conditions from the present-day distribution of mass. As we saw in the previous 
section, on large scales (i.e., when one spatially averages the mass) this assessment is too 
pessimistic. When smoothed sufficiently the density fluctuations have small amplitude 
and they evolve in a simple way. Consequently, the relation between the initial (i.e., 
immediately after recombination) and present density, velocity and potential fields is 
approximately linear on large scales. Eqs. (0) and fll2|) reflect this linearity. 

However, on small scales cosmic gravitational dynamics is strongly nonlinear and 
chaotic. The Lyapunov time for individual particle trajectories is about one orbital 
time in the dense bound clumps that form by gravitational instability [fToll - Even if we 
could specify the position and velocity of every particle in the universe today, the strong 
chaos of gravitational dynamics would make it impossible to integrate the trajectories 
backwards in time. Rather than go backwards, therefore, cosmologists try different 
theories for the initial conditions and integrate forward in time using gravitational N- 



body simulations [O] in which each particle represents a cloud of dark matter. More 



recently some workers have also begun to include gas dynamics in their simulations in 
order to follow the baryonic matter — a trace contaminant in many theories, but all that 
we can see! 

Galaxy formation is very difficult to simulate for two reasons. The first one is the large 
required dynamic range in length and mass scales (cf. Table ^). Individual luminous 
galaxies are smaller than 10 23 cm but they reside in structures thousands of times larger. 
Realistic simulations would require a dynamic range of at least 10 4 in length and 10 9 
in mass. Simulations of this size are beyond the current state-of-the-art, although they 
should be possible within five years. 

The second difficulty is the complexity of gas dynamical processes, which only exacer- 
bate the dynamic range problems. Impressive calculations of cosmological gas dynamics 



have recently been made (refs. [|120| , [71 , [72] , |100| , |26| , [42], and references therein), but the 



greater computational cost (and more severe timestep restriction) of gas dynamics rela- 
tive to gravitational simulation has severely limited the dynamic range in mass and/or 
length. Even when these problems are ameliorated by much more powerful computers, 
the complexity of radiative processes, magnetic fields, etc., will continue to challenge 
the computational astrophysicist. Star formation — which is known to be important 
in determining the appearance and evolution of galaxies — can be treated only in a 
phenomenological manner at best. 

Is galaxy formation so complicated then as to defeat our attempts to test theories of 
cosmic structure formation? Probably not, for several reasons. First, because all theories 
of the initial conditions are stochastic, it is unnecessary for calculations to correctly 
reproduce every detail of the evolution beginning from a specific state. In effect, we 
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demand only that the coarse-grained distribution of mass, averaged over galactic or 
larger scales, have the correct statistical properties. Second, it is plausible that gas 
dynamical effects are important only within galaxies and that galaxies form at minima 
of the gravitational potential field of the dark matter. (Gas dynamical simulations at 
least should be able to test this hypothesis.) If so, then gravity is sufficient to identify 
the sites of galaxy formation, if not the internal properties of the galaxies themselves. 
Third, analytical arguments suggest and numerical simulations confirm several simple 
scaling relations for the evolution of self-gravitating dark matter, allowing us to extend 
the dynamic range of simulations in a statistical sense. 

In the following sections we shall present these scaling relations, followed by a case 
study of a particular structure formation model, the standard cold dark matter model. 
As we shall see, that model appears to make predictions at variance with observations, 
leading cosmologists to explore alternatives. 

3.1 Hierarchical Clustering 

Let us begin with visual examination of the mass distributions produced by strongly 
nonlinear gravitational instability. Figure [8] illustrates gravitational clustering beginning 
from white noise perturbations of a homogeneous and uniformly expanding = 1 uni- 
verse: 128 3 particles were simply placed uniformly and independently at random in a 
cube with zero peculiar velocities at the initial time. Unlike fig. |3|, comoving coordinates 
are used here to factor out the mean cosmic expansion. The top three and bottom left 
two panels show the time evolution of the entire volume while the last panel is a ten 
times magnification of the lower right-hand corner of the last output. The largest clump 
in the bottom right panel contains about 7500 particles. The spatial resolution is about 
10~ 3 of the simulation size and the entire simulation consumed about 100 Cray Y-MP 
hours for 630 timesteps. 

Poisson initial conditions are not believed to be realistic for our universe. For our 
purposes, however, this N-body simulation provides an excellent illustration of the pro- 
cess of hierarchical clustering. In this process, mass is gathered by gravity into dense 
clumps, which merge successively to form larger clumps. The universe remains homo- 
geneous on the largest scales, but the transition length scale between homogeneity and 
strong clustering — called the clustering length £ c (t) — increases with time in comoving 
coordinates. 

It is straightforward to understand the increase in the clustering length by extrapo- 
lating linear theory with a simple model of nonlinear effects. In fig. |3] we see that the 
mass in an overdense region expands until the relative density contrast with its surround- 
ings becomes of order unity. At that point the mass collapses to form a gravitationally 
bound system that ceases expanding with the universe. At any given time, therefore, 
the clustering length is roughly the length scale on which the rms density fluctuation 
is about unity. Since linear theory is obeyed to a reasonable approximation until the 
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Figure 8: A mosaic showing time evolution of the projected mass distribution in an 
Q = 1 universe with particles initially distributed as a Poisson process. The first five 
panels show the projection of all the mass (colors represent logarithm of projected mass 
density) at expansion factors 8, 23, 64, 125, and 250 after the start of the simulation. 
The bottom right panel is a ten times magnification of the lower right-hand corner of 
the last output. 
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density contrast equals unity, we may estimate the clustering length using linear theory. 
This is most naturally done using Fourier analysis to decompose the density field into its 
spatial frequency components. The rms density fluctuation on comoving scale i = k" 1 is, 
by analogy with eq. (|10|), [k 3 Pg(k, t)} 1 ^ 2 . Setting this to unity gives an implicit equation 
for l~ x = k c {t). 

In an Q = 1 universe, as a result of gravitational instability small-amplitude (linear) 
density fluctuations evolve in proportion with the expansion scale factor: Sp/p oc a(t) 
with fixed spatial dependence in comoving coordinates |M| |7[]. (If Q < 1 the growth is 
less rapid, while if Q > 1 it is more rapid.) This behavior is manifested by the first three 
panels of fig. |8|, in which the contrast increases but the spatial pattern changes little on 
the scales resolved in this image. As a result, the power spectrum evolves as Pg(k,t) oc 
[a 2 (t) / a 2 (ti)]Ps(k , ti), where t% may be taken to be any time after recombination while 
the fluctuations are still evolving linearly. 

If Q = 1 and the initial power spectrum is a power law P$(k,tj) oc k n , then l c oc 
fl 2/(3+n) jf n > _3 ; t;h e initial fluctuation amplitude is small on large scales and large on 
small scales. The resulting behavior, with the comoving clustering length growing with 
time, is called hierarchical clustering. The same qualitative behavior results even if the 
initial power spectrum is not a power law, as long as k 3 Ps(k, t) grows with k. Such a field 
is not smooth — it is infinitely spiky on arbitrarily small scales. In ideal hierarchical 
models, the density field is nonlinear from the beginning on sufficiently small scales. The 
standard cold dark matter model (section |3.2| ) is a hierarchical model. 

The N-body simulation shown in fig. [| exhibits hierarchical clustering with initial 
density spectrum exponent n = 0. Our simple theory then predicts that the clustering 
scale should increase by factors of 2, 4, 6, and 10 for the last four outputs (a = 23, 64, 
125, and 250, respectively) relative to the first output (a = 8). This scaling is plausible 
visually from the mean spacing of typical dense clumps and it is borne out by a detailed 
analysis of the nonlinear power spectrum. 

In models where k 3 P$(k,ti) decreases as k —* oo, by contrast, small objects do not 
collapse first. Indeed, nothing collapses until k 3 Ps(k,t) « 1 first has a solution at t c on 
scale £ c = k~ x . The initial density field is smooth with a coherence scale £ c . A coherence 
scale can be built in by physical processes that suppress linear growth on small scales, 
such as the collisionless (free-streaming) damping that occurs if the dark matter has large 
thermal velocities (e.g., light massive neutrinos fL6"|). In this case the initial stages are 



described by the quasilinear Lagrangian theory of Zel'dovich [ 121 , 107], modified for the 



effects of shear and tides jT2). Historically such models were called "pancake" models 
after Zel'dovich's description of the generic shapes of the first objects to collapse. In 
realistic models, however, the power in the density field decreases on larger scales (this 
is required by fig. 0), so that pancake models at late times look like hierarchical models. 

Hierarchical clustering is complicated in detail because it involves the gravitational 
interaction of infinitely many degrees of freedom. Nevertheless, by applying simple scal- 
ing arguments similar to the one given above for the evolution of the clustering length, 
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cosmologists have been able to devise simple analytical theories for various properties 
of the nonlinear density field, such as the nonlinear power spectrum, the distribution of 
clumps by mass, the typical internal density profiles of clumps, the clustering properties 
of mass and clumps, etc. |)7|, 0, |, [57], |89f . 

One is tempted to identify the dense dark matter clumps formed by hierarchical 
clustering with the dark matter dominated halos surrounding galaxies. This idea is 
supported by high-resolution cosmological simulations including gas, which show that 
dense gas indeed collects in the dark matter clumps |7I], [72|, f42|. However, the dark matter 
clumps in most Q = 1 models appear to merge excessively compared with luminous 
galaxies. The discrepancy may plausibly be solved by the radiation of energy by shock 
heated gas, allowing the gas to sink toward the centers of the dark matter potential wells 

wm. 



3.2 The Cold Dark Matter Model 

The cold dark matter (CDM) model was the most popular specific model for cosmic 
structure formation during the 1980s |45|, |31|, [7j], ^(J. First proposed by Peebles [PHI , it 



soon replaced the pancake model with light massive neutrinos as the leading theory for 
structure formation ]TJ|, |30(| . The ingredients of the CDM model include the standard 
big bang theory plus nonbaryonic dark matter (denoted "X" and supposed to be some 
new elementary particle) with 

1. Qx = 0.95 and Qb = 0.05 (inflation predicts Q = 1, while primordial nucleosyn- 
thesis models favor small baryonic abundance); 

2. h = 0.5 (small Hubble constant, implying for fl = 1 a cosmic age of 13.2 billion 

years); 

3. scale-invariant {n = 1) gaussian isentropic density fluctuations. 



Before the measurement of the large scale microwave background anisotropy made last 
year, the CDM model had one free parameter, the normalization of the scale-invariant 
spectrum. Conventionally this was described, based on measurements from galaxy red- 
shift surveys |32|| , by the rms relative fluctuation <jg in mass in randomly placed spheres of 
radius 8 h~ l Mpc. If linear theory is used to compute <jg, it is simply related to the poten- 
tial amplitude A = A(k 0, £ ) of eqs. (0) and (0) by a 8 = 1.6 x 10 5 A Q . The COBE 
normalization for the standard CDM model implies linear a 8 = 1.05 ± 0.17 [[40], |105|| . 
Observations yield <7g = 1.0 for galaxies, but, as mentioned in section |2l|, it is possible 



that galaxies are more strongly clustered than mass, with a 8 (galaxies) = 6a 8 (mass) with 
b > 1. 

The CDM model became widely known after the first high-resolution N-body simu- 
lations of nonlinear gravitational clustering were published in 1985 by Davis et al. |3C 
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These authors concluded that one cannot simultaneously fit galaxy clustering and the 
relative peculiar velocities of galaxies (essentially the "temperature" of the galaxy dis- 
tribution) for any erg if b = 1. The essential problem is that galaxy thermal motions 
(small-scale relative velocities) grow too rapidly with er 8 so that when the clustering 
is sufficient, the galaxy distribution is far too hot. Davis et al. proposed a solution: 
set <t 8 = 0.4 (making the thermal velocities of galaxies acceptable) and require that 
the galaxies cluster more strongly than the mass by assuming that b = 2.5. Unless 
the anisotropy measured by COBE is mostly due to something other than the gravita- 
tional potential fluctuations of eq. (0), this model is now ruled out. However, clever 
theoreticians can find other ways to produce microwave anisotropy (e.g., deflection of 
the microwave radiation by long- wavelength gravitational radiation |ff5| , [34], [F7|), so the 
erg = 0.4 model is worth examining further based on its predictions for galaxy clustering 
and velocities. 

One problem with the biasing idea is that it appears ad hoc; why should galaxies 
cluster 2.5 times more strongly than dark matter? A higher- resolution N-body simulation 
in 1987 performed by White et al. [|1 1 7|| showed that some "bias" indeed arose naturally 
as a result of preferential formation of galaxies in dense regions. 

During the pre-COBE period 1986-1991, many authors concluded that the b = 2.5 
(og = 0.4) CDM model has too little large-scale power to account for large-scale clustering 
and motions (e.g., |13|, |103|| ). Achieving adequate large-scale power evidently requires a 



larger <7g. In retrospect this is not surprising, as fig. [7] shows that roughly consistent 
normalizations are implied by COBE and large-scale structure. 

In 1989, Carlberg & Couchman made the important point that galaxy velocities 



may be "biased" (relative to the mass) as well as the galaxy number density field |21 



Their simulations — with resolution comparable to the best previous calculations \\117\\ 
- showed a significant "velocity bias:" the temperature of the galaxy distribution was 
about four times less than that of the mass. Thus, if a high amplitude normalization is 
assumed (cr 8 = 1.0), the galaxy clustering may be strong enough without the small-scale 
peculiar velocities being excessive. However, this velocity bias does not appear to persist 
on the larger scales probed by POTENT discussed above in section 0|. 

More recent numerical work indicates that velocity bias is probably inadequate to rec- 
oncile the simulated velocities with observations |7I], |2S], |49j. Most workers therefore 
reject the CDM model. However, given that the simulations are limited in dynamic range 
and the treatment of physical processes, this conclusion should be considered tentative 
and subject to reexamination as simulations improve. 



3.3 Alternative Models 

The search for alternative models is guided by analysis of the problems of cold dark 
matter combined with measurements of large-scale structure (fig. [7]). The chief problem 
with the cold dark matter model appears to be excessive power on small scales. This is not 
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apparent in fig. 0, which appears to show a good match between the theoretical (solid) 
and measured (dashed-dotted) curves at large k. However, the theoretical curves are 
based on the linear power spectrum; N-body and analytical calculations for Q = 1 show 
that the nonlinear spectrum is enhanced by mode-coupling effects |65|. The excessive 



small-scale motions predicted by the theory imply the need to decrease the masses of 
galactic-scale clumps. This would also bring the theory better into agreement with 
observational estimates of the mass in galaxies and clusters 



113fl . 



Three ways have been suggested to decrease the masses of galaxies and clusters in the 
CDM model without sacrificing the relatively good agreement of the model with large- 
scale structure when normalized to the microwave background anisotropy [ |119| , [40], |112|| . 
The first is simply to decrease the mean mass density of the universe, parametrized by Q, 
so that there is less mass everywhere. Such models are further subdivided by whether or 
not they include a cosmological constant (Einstein's "blunder" resurrected ||) in order 
to make the cosmic curvature K (eq. [I]) vanish as predicted by inflation theory. Open 
universe models without a cosmological constant require fluctuations from a source other 
than quantum fluctuations during inflation, and such models are more complicated and 
uncertain than inflation ||109| , p8 |. 

Simulations of cold dark matter with Qx = 0.2 and the remainder made of a cosmo- 
logical constant have been performed by several different groups [41, 81, 111 , 24]. These 
authors find that the model looks very promising from the viewpoint of galaxy formation 
and clustering. However, as fig. |7| indicates, it is in conflict with measured large-scale pe- 
culiar velocities [ 106| |. Given the possible systematic errors of galaxy distance estimates, 
however, it may be prudent not to reject the theory without further examination. 

Another way to reduce galaxy masses is to "tilt" the primeval spectral index away 
from the scale- invariant slope n = 1, i.e., to modify the scale- invariant form of the 
primeval potential amplitude function A(k,t). Because the COBE measurement con- 
strains A at small k (fig. decreasing the small-scale power requires decreasing n. 
With less power on small scales, galaxies should be less massive. However, such a model 
must still be compatible with galaxy clustering and peculiar velocities. Unfortunately, 
even n = 0.7 |2^| still leads to excessive thermal motions of galaxies [131]. This problem 
can be ameliorated by decreasing n further or reducing the normalization of potential 
fluctuations by making some of the cosmic microwave background anisotropy with grav- 
itational radiation (refs. J75], |34|, ]77|), but then the models have too little power in the 
wavenumber range 0.01 ^ kMpc/h £ 0.1 (fig. [?]) ||106|| . Positive tilts {n > 1) are a 
nonstarter because they produce excessive small-scale structure. 

The third way potentially to repair the cold dark matter model is to replace some of 
the cold dark matter with a mass component that clusters less strongly. The cosmological 
constant model mentioned above is one extreme version, but a simpler method (from the 
viewpoint of fundamental physics) is to suppose that one type of neutrino has a very 
small mass, equivalent to a rest-mass energy of several eV (refs. |114| , [33], |112| , ff3|[ 
and references therein). Neutrinos were created in the big bang by thermal processes 



28 



but they effectively ceased interacting with the rest of the matter and radiation shortly 



before the era of primordial nucleosynthesis ||74|| . Since that time their comoving number 
density has been conserved and is comparable today with the number density of photons 
in the microwave background radiation. They have a mean thermal energy comparable 
with the mean energy of microwave background photons. In the past the thermal energy 
per neutrino was high; consequently, massive neutrinos are often referred to as hot dark 
matter. There are three types of neutrinos (electron, muon, and tau), but it is likely 
that at most one of them (presumably the r) has a cosmologically interesting mass. A 
neutrino of mass 97 h 2 eV would close the universe by itself with Q u = 1. 

The thermal velocities of neutrinos cause them to cluster less strongly than cold dark 
matter: gravity cannot confine hot dark matter in a shallow potential well. If there 
is no cold dark matter at all, then the massive neutrinos stream out of the potentials, 
which are thereby erased. The resulting model has too little power on small scales and, 
as mentioned in section EO, was rejected before cold dark matter became popular. If, 
however, the dark matter is an admixture of hot and cold dark matter, perhaps one 
can adjust Q u (which is proportional to the neutrino mass) so as to suppress small-scale 
clustering sufficiently while retaining the successes of the COBE-normalized cold dark 
matter model for large-scale structure. 

The mixed dark matter model with Q u = 0.3 and h = 0.5 (requiring a neutrino of 
mass 7 eV) has been studied recently by several groups with N-body simulations |33|, [73 



and gravity plus gas dynamics pTfl . The first two groups concluded that this model looks 
very promising, while the latter authors find that the model is unsatisfactory because the 
suppression of small-scale power is inadequate to solve the small-scale clustering problems 
of CDM but at the same time too much small-scale power is removed to form galaxies 
sufficiently early. Because all of these simulations are based on moderate-resolution grid 
methods that fall far short of the dynamic range desired for realistic simulations, their 
conclusions should be considered highly tentative. 

To test models with resolution adequate to study the formation and clustering of 
galactic halo-sized dark matter clumps, I have performed large N-body simulations of the 
CDM, CDM plus cosmological constant, and mixed dark matter models. The standard 
CDM (Q — 1 and h = 0.5) simulation has 144 3 particles and is described and analyzed in 
1~5]. The simulation required 770 IBM 3090 hours to evolve to a = 1.0 (the present time) 
with 1200 timesteps. The model with a cosmological constant has flo — 0.2 and h = 0.8, 
with 128 3 particles, and required 250 Cray Y-MP hours to evolve to a = 1.2 with 2061 
timesteps. The mixed dark matter model has 128 3 cold and 10 x 128 3 hot particles with 
initial conditions (for Q u = 0.3 and h = 0.5) generated as described in []8"0|1 . It required 



500 Convex C-3880 hours to evolve to a = 0.5 with 265 timesteps. In all cases, a = 1.0 
corresponds to the COBE normalization. All simulations (as well as the one shown in 
fig. ^|) were performed using the adaptive mesh-refined particle-particle/particle- mesh 
algorithm |29j, [49], had spatial resolution smaller than 10 -3 of the simulation size, and 
conserved energy to a fraction of a percent. 
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Figure 9: The projected mass density for four different models, all in a cube of comoving 
length 50 h~ x Mpc at a redshift z — 1 (when the universe was about one-third its present 
age). The upper left panel is cold dark matter with linear a 8 = 0.5. The upper right panel 
is the same model at the smaller amplitude linear <jg = 0.2. The lower left panel is cold 
dark matter with Qq = 0.2 plus a cosmological constant added to cancel the background 
spatial curvature. The lower right panel is mixed (hot plus cold) dark matter with 
Q u = 0.3. All models except the upper right one are normalized to COBE as in fig. 7. 
Colors represent the logarithm of the projected mass density ranging from 1 to 20 times 
the cosmic mean. 
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Figure^ compares the mass distributions in these models at redshift z = 1. One sees 
that both the abundance and clustering of dense dark matter clumps varies widely among 
the models. The clumps in the CDM model grow significantly between og = 0.2 and 0.5 
(top two panels), by which time there are already too many massive clumps to represent 
galaxy halos ||49|| . The large-scale clustering in this model is rather weak. When Q 



is decreased to 0.2 (bottom left panel), the large-scale structure increases significantly 
and the numbers of dense clumps becomes more reasonable. Although the most massive 
clumps have larger density contrasts than in the CDM model, their masses are smaller 
because the model has 5 times less mass overall (Qq = 0.2). Finally, it is evident that 
the mixed dark matter model with Q u = 0.3 has much less nonlinear structure at z — 1 
than the other models. The low abundance of dense dark matter clumps suggests that 
this model may not form enough objects to match observations of galaxies and quasars 
at high redshift. Quantitative analysis of these simulations will be published elsewhere. 



4 Conclusions 

We have divided the whole of cosmic structure formation theory into three broad areas: 
(1) homogeneous big bang cosmology, (2) large-scale structure, and (3) galaxy formation. 
This division may be regarded as a sequence of physical scale — ranging from the greatest 
distances that we can possibly see, to the merely astronomically large — or as one of 
complexity. On the largest scales (greater than 10 27 cm) the universe appears remarkably 
uniform, yet on much smaller scales the universe is richly textured, with virtually all of 
the luminous matter concentrated into objects — galaxies — a million times denser than 
the cosmic mean. The major challenge facing cosmologists is to account for this spatial 
progression from order to disorder. 

It is clear that cosmic structure formation is not going to be explained by a single sim- 
ple physical theory in the way that elementary particle interactions are explained by the 
standard model of particle physics. A hierarchy of theoretical models is required, with 
fundamental physics — a theory of gravity and spacetime plus theories of the behavior 
of matter and radiation at extreme energies in the early universe — required for the uni- 
verse as a whole, progressing to theories, in effect, of gravitational and hydrodynamical 
turbulence on galactic scales and below. However, if this subject is to be a hard science, 
it must have theories that make specific predictions that can be tested by data. In this 
article we have given an overview of these theories: general relativity, the Robertson- 
Walker spacetime models, gravitational instability, and nonlinear gravitational and gas 
dynamics. 

How do these theories stack up against observations? On the largest scales, the 
agreement between the predictions of the big bang theory and four different types of 
observations is extremely good (section [I]). The remarkably accurate measurements of 
the spectrum and isotropy of the cosmic microwave background radiation made by the 
COBE satellite strengthen an already solid foundation. No crises have shaken the big 
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bang theory lately (occasional news reports notwithstanding) and no alternative theory 
has appeared as a serious rival. However, the nature, amount, and distribution of dark 
matter remain important outstanding questions. 

What about theories of large-scale structure? The prevailing paradigm is that struc- 
ture evolved as a result of the gravitational instability of the homogeneous big bang 
model, seeded by some source of small-amplitude fluctuations of very large scale (rang- 
ing in wavelength from at least 10 22 to 10 28 cm). As we have seen, the gravitational 
instability paradigm predicts relations for three dynamical fields — the gravitational po- 
tential, the velocity, and the density — that can be tested by combining measurements 
of cosmic microwave background anisotropy, galaxy motions, and galaxy clustering. A 
major success has occurred during the last two years, as data in all these areas are being 
combined (fig. |7|) to test our paradigm of large-scale structure. Although the measure- 
ment uncertainties are still large and difficult to quantify, the good agreement (better 
than a factor of two over three decades of wavelength) has given strong encouragement to 
cosmologists to pursue more detailed modeling. The observational situation is expected 
to improve further during the next few years as major projects are underway in all areas 
of large-scale structure. 

On smaller scales, however, no theory presently stands out as a clear leader, in large 
part because of the uncertainty in the nature, amount, and distribution of dark matter. 
This is a reversal of the situation five years ago, when the cold dark matter theory was 
favored by many cosmologists. High resolution computer simulations combined with 
improved observations have raised serious problems for the cold dark matter model. In 
fact, the demise of this theory is a mark of progress, showing that theorists can usefully 
calculate the consequences following from a few precepts of early universe cosmology, 
with enough accuracy to be contradicted by observations. Indeed, we now have strong 
guidance as to how the model should be changed: the small-scale power (or the masses 
of galaxy halos and clusters) must be diminished while retaining or slightly boosting the 
large-scale structure needed for COBE, peculiar velocities, and large-scale clustering. 

It is no mark of shame that, even with this guidance, we do not yet have a stan- 
dard model of galaxy formation to replace cold dark matter. Any other theory is more 
complicated — cold dark matter has virtually no free parameters — and the recognition 
of the importance of large-scale structure has boosted the dynamic range requirements 
to a level straining our present computational capabilities. Future tests will rely heav- 
ily on supercomputer simulations of nonlinear gravitational clustering and complex gas 
dynamical interactions. Cosmic structure formation is a grand challenge computational 
problem of the physical sciences. Given the rate of progress in this field, I am optimistic 
that before the end of this decade we will have a well-tested standard model of galaxy 
formation. 
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